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Adding and multiplying random matrices: 
a generalization of Voiculescu's formulae 
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Abstract 

In this paper, we give an elementary proof of the additivity of the functional inverses 
of the resolvents of large N random matrices, using recently developed matrix model 
techniques. This proof also gives a very natural generalization of these formulae to the 
case of measures with an external field. A similar approach yields a relation of the same 
type for multiplication of random matrices. 
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1. Introduction 

In the theory of free random variables [|l|, a remarkable additivity property of the 
functional inverses of the spectral resolvents is found, allowing the addition of random 
variables. There is also a similar formula for multiplication of random variables. From 
now on, we shall call them Voiculescu's formulae. These mathematical results have some 
interesting applications: indeed, it turns out that large size (independent) random matrices 
with certain measures are free variables. Therefore it becomes possible to compute the 
resolvent of the sum of two random matrices from the knowledge of the resolvent of the 
separate matrices, i.e. to add (and multipliy) large random matrices. This, in turn, applies 
to various physical situations: "deterministic + random" problem 0|| (noting that for 
gaussian randomness, the addition formula essentially reduces to Pastur's equation ||), 
random matrix methods applied to QCD ||, non-hermitean random matrices |J, the 
Anderson model [[7]. Since in the "planar" large N limit (N size of the matrices) that 
we consider here one cannot compute n-point connected correlations of the eigenvalues or 
other N — > oo subdominant corrections, alternative methods (such as the supersymmetric 
method, see review || and references therein) may be required for a more detailed analysis; 
but for many problems, it is still very important to be able to compute the density of 
eigenvalues (1-point function), which Voiculescu's formulae provide. 

It is therefore of great interest to find an elementary proof of these formulae. We shall 
mention one such proof by Zee of the addition formula, which is based on a perturbative 
approach: the measures of two random hermitean matrices Mi and M2 are assumed to be 
derived from an action of the form tr V(M) where V is a polynomial, and the perturbative 
expansion is represented diagrammatically, leading to a diagrammatic proof of Voiculescu's 
formula. 

However, this proof has limitations. First it assumes ?7(iV)-invariance of the actions. 
Of course one might object that if we assume both measures to be non [/(iV)-invariant, 
then Voiculescu's formula is not true any more (there a is an obvious counter-example, 
which is the case of two fixed matrices). And if one measure is U(N) -invariant and the 
other is not, one can freely replace the non-invariant measure dfi(M) with an invariant 
one by averaging on the unitary group: 
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This replacement will not affect the resolvent of the sum of M and of another random 
matrix with £7(iV)-invariant measure; however, it is not completely innocent since even 
if the original measure d/i(M) was derived from a simple polynomial action, there is no 
reason for dfi e g(M) to possess the same property 

We see that the problem is that this proof does not allow for general enough measures; 
in particular, a very interesting physical application is the case of a fixed matrix (for the 
deterministic + random problem), where the corresponding measure is highly singular (5 
function) and does not fit in this perturbative framework. 

We propose in this paper a new proof of both addition and multiplication formulae, 
which makes very few assumptions on the measures; it is based on recently developed 
matrix model techniques which have been successfully applied to physical models (Kj . 
In section 2, we shall show how to add matrices by introducing an external field in the 
measure (as in H); and in section 3, we shall multiply matrices by adding this time 
a character in the measure (as in [|10|]). The proof has the obvious advantage that it 
generalizes the usual addition formula to the case of a measure with an external field (and 
similarly, the multiplication formula to the case of a measure with a character insertion). 
Section 3 is devoted to a summary of the results and conclusions. 



2. Adding random matrices 

Before adressing the problem of the addition of several matrices, we shall explain 
our approach by considering a single N x N hermitean matrix M with a [/(iV)-invariant 
measure d^u(M). The only assumption we make about this measure is that the diagonal- 
ization of M leads to a saddle point for the eigenvalues of M; that is, after integrating 
out the angular degrees of freedom, the dominant large iV contribution is obtained by 
simply considering the eigenvalues to be fixed (up to a permutation of the eigenvalues). 
This a reasonable assumption, since as iV — > oo, there are only iV eigenvalues, as opposed 
to the N 2 de grees of freedom of the full matrix. For example, a typical measure that is 
encountered in physical problems is: 

d/x(M) = ]JdM u ]Jd Re My dim My exp(-5(M)) (2.1) 

i i<j 
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where S(M) is the action, which is invariant - S(M) = S(ttMtt^) for all E U(N) - and 
scales likes iV 2 as N — > oo, which ensures a saddle point for the eigenvalues^. However, 



the action d/i(M) does not have to be of the form (|2.1| ), and in particular can be more 
sine: ulari. 

We now introduce the partition function with an additional external field A (see 
11| , 12| ,|9[] for the appearance of such an external field in physical models): 



Z(A) = J dfi(M)exp(NtrMA) (2.2) 

where A is a fixed hermitean matrix. When N — > oo, one must consider a sequence of 
N x N matrices A such that their spectral density tends to a continuous density pa{&) on 
the real axis. Since the measure is [/(iV)-invariant Z(A) depends only on the eigenvalues of 
A, and for definiteness, we shall choose A to be diagonal, with eigenvalues aj, j = 1 . . . N. 
We can go over to the eigenvalues Xi of M by using the Itzykson-Zuber-Harish Chan- 



dra formula [13 



where A[-] is the Van der Monde determinant, and d//[Ai] is the resulting measure on the 
eigenvalues; for example, with a measure of the type (|2.1|) , we have: 



*M = / n«,ex P (-S|A^A[A,]^«^M (2.4) 

where we have used the fact that the action S only depends on the eigenvalues Aj of M. 

Finally we introduce the logarithmic derivative of Z with respect to the eigenvalues 
aj. According to (|2.2j ), it is simply expressed as an average 

1 d 



Ndaj 



\ogZ[a j } = (M jj )^ 



where the subscript A indicates that the average is made in the presence of the exter- 
nal field, i.e. with the measure dfi(M) exp(iVtr MA). A more useful expression for this 
logarithmic derivative is found by applying 



Id r , lid det[exp(iVAiaj / 



Ndaj 1 J1 N \ d aj ° A [a 



^Z[aj] = -(^-log 1 f „ ; 3n (2.5) 



.4 



1 For example, S(M) can be of the form: S(M) = N tvV(M), where V is a given polynomial; 
but more general actions with products of traces are possible. 

2 For example, for a fixed matrix, after averaging over the unitary group U(N), the measure 
is a 5 function for the eigenvalues. 



3 



The kind of derivative that appears in (|2.5| ) has been studied in ; we shall briefly review 
the results we need, and refer the reader to the appendix 1 of for the technical details. 
In the large iV limit, the spectral density of A tends by definition to the continuous density 
Pa, and similarly, since there is a saddle point for the eigenvalues of M, we assume that 
the spectral density pu of M becomes also continuous. Then, the derivative with respect 
to dj ( |2.5|) becomes an analytic function f(aj) of its argument aj, of the form: 

/(o) = A(a)-o; A (o) (2.6) 

Let us define the two functions in (ET6|): ua{o) is the resolvent of A: 



1 1 f da'p A (a') 

maw = — tr — ' 



N a — A J a — a' 

It is an analytic function of a except for a cut on the support of A (which is contained in 
the real axis). If we introduce the notation lJja{o) = ^(ua(o> + iO) + ^a{cl — iO)) for a real, 
so that u>a(o> ± iO) = <J)a(o) T ^Pa(o), then 

^K) = ^^-logA[a J ] (2.7) 

Similarly, A(a) is defined by the following requirements: it has the same cut as ua(o>) 
on the support of pa, and it satisfies: 

= (|^-logdet[exp(iVA l a J )]^ (2.8) 

Of course, A(a) may have more cuts than ua(o), whose positions are left undefined; so one 
should really think of A (a) as a multi- valued function, living on a branched covering of the 
complex plane. 

Note that combining (|2.7|) and (|2.8p and using the fact that ua(o>) and A(o) have the 



same cut, one finds the expression (|2.6| ) for the logarithmic derivative ( |2.5|) . 

It is now possible to connect the function A(a) with the resolvent cj^(A) of M: 

a\ / 1 , 1 \ fdX' PM {X') 
wm(A) = - tr- 



N X — M / A J A - A' 



Indeed, it was shown in |Q] (see also the earlier work ||14| ) that if one introduces in a 
symmetric way the function a(A) with the same cut as um(X) and such that 

= ^ (^■^■logdet[exp(NX i a j )]J 



then a(A) and A(a) are functional inverses of each other as multi- valued analytic functions. 

Let us now take the limit A — ► oi, that is pa{o) — > S(a) or still ua{o) — > 
1/a. In this limit, from the Itzykson-Zuber-Harish Chandra formula, one infers that 
det[exp(iVAjaj)]/A[aj] ~ A[Aj], so that a(A) tends to the resolvent wm(A), Therefore, for 
A = 0, A (a) is precisely the functional inverse of the resolvent we were looking for. 

It is now clear that the obvious factorization property 

exp(iVtr(Mi + M 2 )A) = exp(iVtr MiA) exp(iVtr M 2 A) 

implies the additivity of the average of its logarithmic derivative: 

((M x + M 2 ) jj ) A = (M 1 . jj ) A + (M 2 . jj ) A 

On condition that the two matrices M\ and M 2 are independent, this can be rewritten as 
the additivity of the function 

A(o) — u>a(ci) 



or for the particular case A = 0: 



A(a) - - 
a 



This is the essence of Voiculescu's formula for adding random matrices. 

Let us see how this works more explicitly, by considering two independent random 
matrices Mi and M 2 with measures d/ii(Mi) and d/x 2 (M 2 ). We shall assume both measures 
to be {7 (AO-invariant, even though, as explained in the introduction, it is not more difficult 
to prove the formula with only one £7(iV)-invariant measure and a non-invariant one. Both 
measures are such that there exists a saddle point for the eigenvalues of M\ and M 2 . 

Then one introduces the partition function with an external field: 

Z(A) = J J d^ 1 (M 1 )d^ 2 (M 2 )exp(ATtr(M 1 + M 2 )A) (2.9) 

Again, due to J7(A r )-invariance of both measures, Z(A) depends only on the eigenvalues 
a,j of A. Therefore we can write that 

Z[aj}= I dOZ(OAO f ) (2.10) 
Jcieu(N) 



3 Note that if one directly takes A = 0, expressions such as ( |2.7l ) and ( p.8|) become meaningless; 
so one must consider a limit where the support of pa has a finite size but becomes smaller and 
smaller. 
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where we use the normalized Haar measure on U(N). By performing explicitly the inte- 
gration over O (once more, the Itzykson-Zuber-Harish Chandra integral), we immediately 
obtain that 

where the A^ are the eigenvalues of Mi + M 2 . We can now introduce the usual logarithmic 
derivative with respect to a^, which is of the form 

Id f , lid det[ex.p(N XiCtj / 



N daj lOSZl "> ] = N W° S AN A = X ^~^ ^ 

where A(a) is connected with the matrix Mi + M 2 ; in particular, for A = 0, it is the 
functional inverse of the resolvent o;mi+m 2 (^)- 

On the other hand, one can diagonalize separately Mi and M 2 , since the partition 
function completely factorizes as Z(A) = Zi(A)Z2(A) 1 with obvious notations. One finds 

J J A[Ai;iJA[ajJ A[A2;ijA[ajj 

Therefore 

1 d 

iV9a" l0gZ ^^ = ( Xl ( aj ^ ~ w A(aj)) + (A 2 (a i ) -LU A (aj)) (2.12) 



Combining (|2.11[) and (|2.12j ) we find that the relation 

Ai(a) + A 2 (a) = A(a) +u A (a) (2.13) 

is valid on the support of the density of pa (a), and by analytic continuation is therefore 
valid on the whole complex plane. 

For A = 0, the functions A(a), Ai(a), A 2 (a) are functional inverses of the corresponding 
resolvents, and ua(cl) = 1/a, so that ( |2.13|) reduces to Voiculescu's formula for adding 
free variables. However, the relation ( |2.13|) still holds for arbitrary A, thus generalizing 
Voiculescu's formula in a highly non-trivial way. 
Remarks: 

1) if we assume that only one measure (e.g. d^i(Mi)) is [/(iV)-invariant, then Z(A) 
no longer depends only on the eigenvalues of A; but we can take ( |2.10| ) as a definition of 
Z[dj], and then the rest of the proof works identically (except that instead of diagonalizing 
M 2 , one integrates over O). 
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2) In the A = case, the connection to the usual diagrammatic interpretation is the 
following. One can show that for A = 0, our definition of f(a) = A(a) — 1/a is equiva- 
lent to: f(a) = jf-^ log (exp(7VoMn)), where Mn is an arbitrarily chosen diagonal ele- 
ment. (exp(iVaMii)) being a generating function of the moments of Mix, its log generates 
the connected moments: log (exp(iVoMn)) = J2^=o ^T° n (-^n n ) c - Furthermore, using 
[/(iV)-invariance of the measure and both planarity and connectedness of the diagrams 
that appear in the perturbative expansion, one has the following large N equality: 

Therefore one finds the usual expansion f(a) = Yl^=o an ~k (trM n+1 ) c . 



3. Multiplying random matrices 

The same type of argument applies to the multiplication of random matrices. Let us 
start again with a single hermitean random matrix with a measure d^u(M) which leads to 
a saddle point on the eigenvalues. We define the partition function with a character. 

Z[H] = — L- : / d^M)xH(M) (3.1) 



dim if 

Here if is a (holomorphic) irreducible representation of GL(N); it can be parametrized 
in the following way: H = {hj] j = 1 . . . iV}, where the hj, j = 1 . . .N, which form a 
decreasing sequence of integers, are the shifted highest weights of H (the shifted highest 
weights hj are connected with the usual highest weights rrij by the formula: hj = N — j + 
rrij). xh{M) is the character of H taken at M. Using Weyl's formula for the character 
Xh{M) and the fact that dim if = est A[/ij], we can rewrite Z[H] in terms of the hj - . 

^-/^(MJ^gL (3.2, 



This expression is very similar to Eq. ( |2.3|) obtained after use of the Itzykson-Zuber- 
Harish Chandra formula. It is now clear that the same formalism will apply (see appendix 



2 of m and |10[ for more details). 

In the large iV limit, we assume that the hj/N (note the important rescaling of a factor 
of N) tend to a continuous density pH^h). We can then consider the hj/N as continuous 
real variables, and introduce the logarithmic derivatives 

9 , og Z W = / « log Mgl) (3.3) 



dhj to L n \dhj to A[h 




( 



^f- stands for ^^L) 



We are now led to the introduction of two functions: the resolvent un(h) 



u H (h) = 



dh'p H (h') 
h-h' 



and the function L(h) which has the same cut as uJH(h) and whose mean value on it is 



We finally define X(h) = expL(/i). 

The eigenvalues also have a saddle point density pm(A), with its associated resolvent 
<jJm(X), and there is a function h(X) which satisfies 



and which has the same cut as Xum(X). h(X) and X(h) are of course functional inverses of 
each other. Note that we were forced to introduce an extra factor of A in the definition of 
h(X) : which is the crucial difference with the previous section. Indeed, let us now choose 
H to be the trivial representation, so that hi = N — z, that is 



Then det[A^'] = A[A»] and therefore h(X) = Xuj m (X): X(h) is now the functional inverse 
of A times the resolvent, and not of the resolvent itself, which is something completely 
different. 

Let us now write down a formula for multiplying two matrices Mi and M 2 , with 
associated measures dfii(Mi) and d/j2(M 2 ). As before, at least one of the two measures 
must be L r (iV)-invariant. We introduce the partition function with a character: 





u H (h) = log 



h 



h-1 




(3.4) 



Direct application of the previous formalism to the product MiM 2 leads to 



d 



\ogZ[hj] = \ogX{h 3 /N) - u H (hj/N) 



(3.5) 



dhj 



where X(h) is associated to the product M\Mi- 
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On the other hand, since one of the two measures is [/(iV)-invariant, we can write 

that 

Z[hj] = [ d&l [[ d^ 1 (M 1 )d^(M 2 ) X H(fiM 1 0tM 2 ) 

dimii Ja e u(N) J J 

Using orthogonality relations for matrix elements of irreducible representations, we can 
integrate over Q: 



Z[hj] = J J d//i(Mi)d/z 2 (M 2 ) 



XgW Xh{M 2 ) 
dim if dim if 

The logarithmic derivative can now be written as: 

d 



dhj 



log Z[hj] = (logXtihj/^-UHihj/N)) + (logX 2 (h j /N)-u H (h j /N)) (3.6) 



where Xi(h) and A 2 (/i) are the functions associated in the usual way to the matrices Mi 
and M 2 . 

Comparing (|3]5|) and ( |3.6|) and exponentiating the resulting formula, as is more ap- 
propriate for multiplying matrices, we find: 

X 1 (h)X 2 (h) = X(h) exp(u H {h)) (3.7) 

that is the multiplicativity of the function X(h) exp(—UH(h)). 

If we now restrict ourselves to the case of the trivial representation, X(h), Xi(h), A 2 (/i) 
are functional inverses of A times the corresponding resolvents, and un(h) = log(h/ (h — 1)) 
so that 

X 1 (h)X 2 (h) = X(h)-^- (3.8) 
h — 1 

Note once more that the functions A(a) in Eq. ( |2.13| ) and the functions X(h) in Eq. 
( |3.7|) are not directly related to each other since they are expressed in terms of different 
variables. 



4. Conclusion 



We have proven two main formulae: Eq. (|2.13|) for the addition of random matrices, 
and Eq. ( |3.7| ) for their multiplication. As far as the author knows, the second formula, 
even in its usual form (Eq. ( |3.8|) ), does not have a simple diagrammatic proof. 
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The proofs used above have the advantage that they clearly highlight the key hypothe- 
sis needed for the results to hold: i) U(N) -invar iance of (at least one of) the two measures, 
and ii) an analyticity property of the resolvents. Let us discuss these two points. 

The £7(iV)-invariance of the measure is an essential ingredient of the proof: without 
it one cannot integrate over the unitary group to use the Itzykson-Zuber-Harish Chandra 
formula or the orthonality formula for characters. This is completely consistent with the 



assertion found in the mathematical literature [ |15| . [T6| that the two matrices should be 
independently [/(iV)-rotated with respect to each other in order to ensure freeness. As 
has already been mentioned, this hypothesis is obviously necessary (case of two fixed 
matrices); but let us also note that when one keeps the external field A non-zero (or the 
representation H non-trivial), then one obtains addition/multiplication formulae which are 
different from Voiculescu's formulae (and, generically, incompatible with them); so that 
for these measures (which of course also break U{N) -invar iance), the random matrices are 
no more free variables, but still satisfy addition/multiplication formulae. 

The analyticity property of the resolvents stems from the fact that we have assumed 
the matrices to be hermitean, which prevents the eigenvalues from moving freely in the com- 
plex plane, and creating dense regions where the resolvent is no more analytic. However, 
the proof does not really make use of the hermiticity of the matrices, and the generalization 
to non-hermitean matrices might provide some useful insight on these more complicated 
matrix models. 
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